Stanford 
cepa 


ine CEPA Working Paper No. 15-20 


Building Teacher Teams: Evidence of Positive Spillovers 
from More Effective Colleagues 


AUTHORS 


Min Sun 
University of Washington 


Susanna Loeb 
Stanford University 


Jason Grissom 
Vanderbilt University 


ABSTRACT 


Student peer effects are well documented. We know far less, however, about peer effects 
among teachers. We hypothesize that a relatively effective teacher may positively affect the 
performance of their peers, while a relatively ineffective teacher may negatively impact the 
performance of other teachers with whom they work closely. Utilizing a decade of data on 
teacher transfers between schools that result in changes of peers when transfer teachers 
enter grade-level team in the new school, we find evidence of strong positive spillover effects 
associated with the introduction of peers who are more effective than the incumbent teacher 
himself or herself. Interestingly, the incumbent teacher's students are not meaningfully 
disadvantaged by the entry of relatively ineffective peers. This finding implies that mixing 
teachers with diverse performance levels can be a strategy for increasing student 
achievement in the aggregate. These results are robust to several student sorting and teacher 


selection issues. 


VERSION 


November 2015 


Suggested citation: Sun, M., Loeb, S., & Grissom, J. (2015). Building Teacher Teams: Evidence of Positive 
Spillovers from More Effective Colleagues (CEPA Working Paper No.15-20). Retrieved from Stanford 
Center for Education Policy Analysis: http://cepa.stanford.edu/wp15-20 


Building Teacher Teams: Evidence of Positive Spillovers from More Effective Colleagues 
Min Sun, Susanna Loeb, Jason Grissom 
Abstract 
Student peer effects are well documented. We know far less, however, about peer effects 

among teachers. We hypothesize that a relatively effective teacher may positively affect the 
performance of their peers, while a relatively ineffective teacher may negatively impact the 
performance of other teachers with whom they work closely. Utilizing a decade of data on 
teacher transfers between schools that result in changes of peers when transfer teachers enter 
grade-level team in the new school, we find evidence of strong positive spillover effects 
associated with the introduction of peers who are more effective than the incumbent teacher 
himself or herself. Interestingly, the incumbent teacher’s students are not meaningfully 
disadvantaged by the entry of relatively ineffective peers. This finding implies that mixing 
teachers with diverse performance levels can be a strategy for increasing student achievement in 


the aggregate. These results are robust to several student sorting and teacher selection issues. 
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Research provides persuasive evidence on teachers’ contributions to student achievement 
(Aaronson, Barrow, & Sander, 2007; Koretz, 2002; McCaffrey, Lockwood, Koretz, Louis, & 
Hamilton, 2004; McCaffrey, Sass, & Lockwood, 2009; Rivkin, Hanushek, & Kain, 2005; 
Rockoff, 2004; Sanders & Rivers, 1996). Studies also suggest that student achievement is a 
function of not just one teacher but of the combined effort of many teachers. For example, the 
change in the quality of a teacher’s colleagues is associated with the change in her or his students’ 
test score gains (Jackson & Bruegmann, 2009). Teachers’ instructional expertise can be diffused 
through professional interactions, and thus change colleagues’ classroom practices (Author, 
2013). Moreover, teachers’ collaboration with one another within teams can increase their 
effectiveness as measured by raising student achievement gains (Author, 2015). Yet the 
documentation of teacher peer effects is sparse and has not spurred wide discussions on the 
strategic grouping of teachers to maximize student learning in the aggregate. 

In 2014, the U.S. Department of Education initiated the 50-state teacher equity strategy to 
call for states and local school districts to increase the equitable distribution of teacher quality 
across schools, particularly to ensure that disadvantaged students are not more likely to be taught 
by unprepared, low-performing teachers. This movement further underscores the discussions 
about how to distribute teacher quality and what the impacts of different ways of re-arranging 
teachers are. As an example, the U.S. Department of Education recently commissioned a study of 
its “Talent Transfer Initiatives” (TTI), an intervention that identified a district’s highest- 
performing teachers and offered them financial incentives for moving to and staying in its low- 
performing schools for at least two years. Although the implementers found it difficult to find 
teachers who were willing to transfer to lower-performing schools, encouragingly TTI had a 


positive impact on math and reading test scores for students in elementary schools, though null 


effects for middle school students (Glazerman, Protik, Teh, Bruch, & Max, 2013). Moreover, 
transferring to disadvantaged schools did not penalize transferring teachers’ own productivity 
(Xu, Ozek, & Corritore, 2012). 

These prior studies of teacher transfers are mainly based on a human capital argument 
that effective teachers contribute to the learning of the students in their own classrooms. 
However, if spillover effects occur, then transfer teachers could also influence school 
productivity beyond their own classrooms. Understanding the spillover and peer dynamics 
among teachers can further inform how to group teachers in a manner that will augment all 
students’ learning. 

This study examines teacher transfers from other schools into existing grade-level teacher 
teams using longitudinal data on teacher transfers between schools in Miami-Dade County 
Public Schools (M-DCPS). We test whether there are spillover effects of the transfer, asking 
specifically whether the transfer teacher’s entry into the team affects the learning of students of 
teachers who have remained in that grade level before and after the new peer entered. 

We test four different potential types of spillovers. First, we look at the average spillover 
effects of new transfer teachers. This “linear-in-means” model assumes that with the arrival of an 
effective peer, all incumbent teachers will improve, and conversely, the arrival of an ineffective 
peer will hurt all others’ outcomes. We then consider the non-linearity of spillover effects 
depending on the difference in prior stable effectiveness between new transfers and incumbent 
teachers—the “relatively effective” and “relatively ineffective” models. The “relatively effective” 
approach models how incumbent teachers’ effectiveness changes in relationship to the degree to 
which the new peer is more effective than they are. This model could reflect knowledge transfer 


from more effective to less effective teachers. Similarly, the “relatively ineffective” approach 


measures the effect of the degree to which the new transfer is less effective. This model could 
capture a drain on incumbent teachers from having less effective teachers enter their grade. In 
contrast to the relativity approach, we lastly examine the variation of spillover effects depending 
on the absolute effectiveness of focal (incumbent) teachers. We use “focal” teachers 
interchangeably with “incumbent” teachers hereafter to refer to those who are already in the 
grade when the new transfer joins the team and whose students’ achievement gains are the 
outcome measures of the analysis. This “absolute effectiveness” model evaluates which types of 
teachers are more or less responsive to peers’ effectiveness. Less effective teachers may be more 
affected by the performance of new teachers, because they need greater support from their peers 
or are more easily influenced. 

Although we find some evidence of positive “linear-in-means” effects, we find stronger 
evidence of positive spillover effects associated with the introduction of relatively effective peers 
into a teacher group. If a student has a new peer teacher at the same grade level who is about one 
standard deviation more effective than that of his or her own teacher, this student would have a 
1.9 or 2.8 percent of a standard deviation increase in math test scores. This spillover effect is 
about 23% or 29% of the student’s own teacher’s effect on his or her achievement gains. We also 
find that effects are asymmetrical; although teachers benefit from a relatively effective peer, their 
students are not meaningfully disadvantaged by the presence of relatively ineffective peer. This 
finding implies the way of grouping teachers to maximize all students’ learning is to mix 
teachers with diverse performance. In keeping with the importance of relatively effective peers, 
we also find some evidence that low-performing teachers are more responsive to the composition 
of his or her peer colleagues than high-performing teachers. Having an effective peer teacher 


particularly benefits students assigned to low-performing teachers. 


In what follows, we first review the literature on spillover effects among employees in 
schools and other workplaces. Next, we describe the four types of spillover in more detail, 
motivated by possible spillover mechanisms among teachers. We then describe the data and 
analytic strategies for testing these models. Lastly, we discuss the main findings and their policy 
and research implications. 

Spillover Effects among Employees in Schools and Other Workplaces 

Research provides evidence of spillovers in both high-skilled and low-skilled workplaces. 
Workers’ wages are higher in firms with more educated coworkers or high-skilled workers 
(Battu, Belfield, and Sloane, 2003; Bauer & Vorell, 2010). Scientists have fewer grants and 
publications after a high-profile scientist leaves their institution. De Grip and Sauermann (2012) 
document the transmission of knowledge learned during a formal training program to other 
employees to justify the returns on a firm’s financing the cost of the general training programs. 
They emphasize that such spillover is magnified in settings where employees need to work in 
teams. Stoyanov and Zubanov (2012) discover that Danish manufacturing firms that hired 
workers from more productive firms experience productivity gains one year after the hiring. 
These gains increase with new hires’ education, tenure, and skill level, and are persistent for 
several years after the hiring. Kurada and Yamamoto (2013) find that when employees were 
transferred from Japan to European branches of the same global firms, these employees 
significantly reduce their work hours, resulting from behavioral influences of locally hired staff. 
The reduction in hours highly depends on the level of the interactions between the transfers and 
local peers. Mas and Moretti (2009) study peer effects among cashiers in supermarkets. Using 


high-frequency scanner data to develop a measure of worker productivity, they find strong 


evidence of positive productivity spillovers from the introduction of highly productive coworkers 
into a shift. 

Although studies have documented spillover effects and peer dynamics among employees 
in other industries, we know far less about peer effects among teachers in schools. Teaching is a 
high-skilled, socially meaningful, and knowledge-intensive profession. It involves collaborations 
with colleagues and learning on the job, making spillover particularly plausible (Cornelissen, 
Dustmann, & Schénberg, 2013). Using longitudinal elementary school teacher and student data, 
Jackson and Bruegmann (2009) find that students have larger test score gains when their teachers 
have more effective colleagues, with the historical peer quality (i.e., estimated value-added from 
an out-of-sample pre-period) for less experienced teachers explaining about 20 percent of 
students’ own-teacher effects. In their attempt to explain the mechanisms, available evidence 
suggests that peer learning may be the major avenue for the transmission of peer effects. 
Although this study provides compelling evidence on teachers’ spillover on student achievement, 
it does not directly measure the knowledge diffusion or peer learning mechanisms, or the 
potential heterogeneity of spillover under different peer compositions. 

Another strand of quantitative research studies uses teacher network data to provide more 
direct evidence on the diffusion of instructional expertise among teachers. One prior study 
(Author, 2013) identifies that the spillover effects of professional development programs through 
teacher collaboration can be as strong as the program direct effects on changing participating 
teachers’ classroom instruction. Teachers whose prior implementation of a new intervention 
were far from the desired practices responded more to direct participation in organized 
professional development, while teachers whose prior implementation were more advanced 


responded more to the sharing of promising practices and engaging in in-depth discussion with 


colleagues (Author, 2012). Besides suggesting peer effects, these findings also provide evidence 
on the heterogeneity of peer influences depending on the level of focal teachers’ prior teaching 
practices. However, this set of studies does not have student learning outcome measures in their 
analyses. Therefore, it is unclear if the changes in teachers’ self-reported instructional practices 
can later be transformed into changes in student learning outcomes. 

Not all studies of peers find positive effects. For example, a study of Teach For 
America’s (TFA) recent placement strategy of clustering large numbers of TFA corps members 
into targeted disadvantaged schools in the Miami-Dade County Public Schools (M-DCPS), finds 
null effect of spillover effects on the school-wide performance (Hansen, Backes, Brady, & Xu, 
2014). 

The mixed results of teacher spillovers on student test scores across studies may arise 
from different assumptions of spillover mechanisms, measures of peer effects, and the 
differences in school contexts in which teachers work. This complexity calls for a finer-grained 
analysis of teacher spillover effects by accounting for different types of spillover mechanisms 
among teachers, so that we can understand what ways of grouping teachers would generate 
positive spillover on student learning outcomes. 

Modeling Teacher Spillover Effects 

The mixed results of teacher peer effects in prior literature call for an in-depth 
investigation of the structure of peers and the heterogeneous effects of teacher peers. We 
hypothesize four types of peer influences, motivated by three potential mechanisms of spillover 
among teachers. These three mechanisms include joint production, pro-social motivation, and 


knowledge transfer (e.g., Author, 2014; Jackson & Bruegmann, 2009; Mas & Moretti, 2009). 


The most common approach to modeling peer effects is the “linear-in-means” model (e.g., 
Graham, 2008; Sacerdote, 2001; Summers & Wolfe, 1977), which suggests that an individual’s 
outcomes are a function of the average outcomes and characteristics of his or her peers. Ina 
teacher grade-level team, the linear-in-means approach implies that with the arrival of an 
effective peer, all incumbent teachers in the team will improve their outcomes, and conversely, 
the arrival of an ineffective peer will hurt all others’ outcomes (Hoxby & Weingarth, 2005). 
Although the linear-in-means model is often chosen because of its convenience and parsimony 
rather than because it is a convincing description of reality (Sacerdote, 2014), the context of 
teacher grade-level teams may make this model more realistic than in other cases. Particularly, 
the model is consistent with a mechanism of joint production in which various tasks that would 
promote student learning are distributed across teachers. For example, teachers may co-teach, co- 
plan, and share duties outside of their own classrooms (e.g., organizing math club). In many 
schools, teachers also work together to develop curriculum materials and analyze students’ 
assessment data (Author, 2015). The addition of an effective peer could increase the overall 
productivity of joint activities, while adding a worse peer could reduce this collective 
productivity. 

Peer effects can also be nonlinear, varying for different individuals (Carrell et al., 2009; 
Hoxby & Weingarth, 2005; Imberman, Kugler, & Sacerdote, 2012; Mas & Moretti, 2009). We 
examine whether peer teachers may have different effects on their grade-level team colleagues 
depending on their “relative effectiveness”. What may matter for whether there is spillover is 
how much more or less effective the new colleague is than the teacher already in the team. 

We consider first the case that a “relatively effective” new peer—that is, a teacher 


transferring into the grade-level team with higher teaching effectiveness than a focal teacher— 


could affect the achievement of the focal teacher’s students. Although the introduction of a 
relatively effective peer could worsen outcomes when focal teachers engage in “invidious 
comparisons” that undermine their confidence or sense of efficacy and, in turn, their effort level 
(Hoxby & Weingarth, 2005), several other potential mechanisms appear to make benefits from 
working with a more effective peer more likely. One such mechanism that is likely to be 
especially important in teacher grade-level teams is knowledge transfer or peer learning (Jackson 
& Bruegmann, 2009). Working with other teachers in teams or professional learning 
communities provides opportunities for information about effective instructional practices to be 
disseminated from one teacher to another (Author, 2013; Author, 2015). Working together in 
teams could allow teachers to share curricular materials, to discuss strategies for instruction or 
classroom management, or to model teaching practices for one another (Coburn & Russell, 2008). 
Each of these transfer mechanisms is likely to benefit less effective teachers given the 
opportunity to work with a relatively effective colleague. 

The presence of a relatively effective colleague may increase a less effective teacher’s 
motivation to work harder or seek out new strategies or techniques to increase his or her own 
effectiveness, through either friendly competition with the colleagues or being influenced by this 
colleague’s enthusiasm for teaching. Moreover, the sheer social pressure of not wanting to be 
perceived by their colleagues as less productive or uncooperative may motivate less effective 
teacher to improve when a relatively effective colleague is present (Mas & Moretti, 2009). 

Next, we consider the case that the entrance of a relatively ineffective new peer could 
affect a teacher’s students. While it is possible that the arrival of a less effective peer could 
increase a colleague’s performance by motivating this colleague to work harder to compensate 


for the lower productivity of the new peer, it seems more likely that this ineffective peer will 


have negative impacts. Knowledge transfer is asymmetric. Although there is always something 
that one teacher can learn from the other, knowledge typically flows from more knowledgeable 
or productive individuals to those who are less so (Author, 2013; Conley & Udry, 2010). An 
ineffective peer is thus less likely to provide the more effective colleague with productivity- 
enhancing insights, and may also impose costs on his or her more effective colleagues by taking 
up their time or attention in attempting to learn from them. At the same time, ineffective peers 
are less likely to bring significant benefits to their colleagues via prosocial pressure. Because the 
motivation process involves a teacher’s reflection of her own performance in reference to her 
peers, opportunities for prosocial pressure more likely happen when a teacher reflects own 
performance in comparison to more effective than less effective peers. 

Different from the “relative effectiveness” model that employs the comparison between 
the new peer and focal teacher, we lastly examine how peer effects vary depending on the 
“absolute effectiveness” of focal teachers. Although there are incentives for a less productive 
incumbent teacher to be a “‘free-rider” who eases the pace when a productive peer comes in (Mas 
& Moretti, 2009), we anticipate that many teachers in public schools are motivated to minimize 
productivity differentials with their more effective peers, because they often have a sense of 
pride for the profession that contributes to social greatness. Motivated by prosocial pressure and 
having more opportunities to receive knowledge as aforementioned, we anticipate the less 
effective incumbent teachers, on average, are more likely to accept the positive influence from 
the new peers. In contrast, effective incumbent teachers are less affected by their peers because 
they may be less motivated to turn to their peers for supports or fewer opportunities to receive 
constructive help. It is important to consider the variation of spillover effects along the absolute 


effectiveness of focal teachers when building teacher teams, because if students of low- 
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performing teachers, in general, disproportionately benefit more from the arrival of effective peer 
teachers, policy-makers and school leaders can leverage teacher peers to supplement the 
contribution of low-performing teachers to their students’ learning. 

Data and Sample 

Our data come from M-DCPS, the fourth-largest school district in the United States, and 
cover the school years from 2003-04 through 2012-13. We focus on math teachers in grades 3-8 
who can be linked to students for whom we have state standardized test scores in math. The data 
cover about 1.15 million student-year observations over the 10 years. Our analysis focuses on 
estimating the spillover effects on students’ math achievement for at least two reasons. First, 
teachers generally have a stronger effect on math achievement than reading, as identified in prior 
studies (e.g., Nye, Konstantopoulos, & Hedges, 2004) and also confirmed by our study in that 
the estimated effects of own teachers on reading test scores are only about one third to one half 
of the estimated effects of own teachers on math test scores. Second, one of our prior studies 
reveals a much stronger and consistent positive association of teachers’ instructional 
collaboration with their math value-added than reading value-added (Author, 2015, pp.28-30). 
More discussions on spillover on reading achievement are included in Appendix A. 

Table 1 describes the sample. Approximately nine percent of students are white; 25 
percent black; 65 percent Hispanic; 49 percent female; 13 percent with limited English 
proficiency; 72 percent eligible for subsidized lunch (FRPL); and 12 percent with special 
education needs. Besides conventional elementary and middle schools, M-DCPS has K-8 schools 
and combination schools (middle and high schools). Across these different school types, a math 
teacher in elementary grades (3-5) typically works with one group of students across multiple 


subjects, while a math teacher in secondary grades (6-8) typically works with multiple groups of 
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students within one subject area. Close to 60 percent of students are enrolled in elementary 
grades, with the rest in middle grades. We define a teacher’s primary grade level as the grade for 
which she teaches the largest number of students in a given year’. 

[TABLE 1 HERE] 

We measure teachers’ annual performance in raising students’ math test scores using 
“value-added” scores. Our preferred value-added model estimates teacher-by-year fixed effects, 
accounting for students’ test scores in math and reading in the prior year, demographics, English 
proficiency, and disability status, as well as the averages of these variables at both classroom and 
school levels (see Appendix B). This model adjusts teacher effect estimates for nonrandom 
assignment to students based on students’ time-varying and invariant characteristics and school 
contexts. A simulation study demonstrates that this model outperforms other popular value- 
added models and the student percentile growth model when nonrandom assignment of students 
exists (Guarino, Reckase, Stacy, & Wooldridge, 2015). To further confirm that our spillover 
estimates do not vary depending on value-added models, we construct alternative value-added 
models with either student or school fixed effects. These alternative models yield similar 
estimates of spillover effects to our preferred model*. After obtaining the teacher-by-year fixed 
effect estimates, we then shrink the estimates using the empirical Bayes methods to adjust for 
sampling and measurement errors and bring imprecise estimates closer to the mean (see Author, 
2012, for a description of the shrinking method). After shrinking the value-added estimates, we 
standardize them to have a mean of 0 and a standard deviation of | in each year to facilitate 
interpretation. 

We then average three lagged value-added measures to account for concerns about year- 


to-year fluctuation of value-added measures due to the variation in true teacher performance over 
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time and measurement error (Author, 2013)°. We name this aggregated measure as teachers’ 
prior stable effectiveness. This prior stable effectiveness has at least two advantages. First, the 
stable effectiveness prior to the peer shock of new transfers avoids the reflection problem in peer 
effect estimation, which we will explain further in next section (Manski, 1993). Second, this 
stable measure mitigates the spurious relationship between new transfers and incumbent teachers 
due to contemporaneous shocks to all teachers at a given point in time. 

There are 1,594 teacher-year transfer observations in the data that have stable teacher 
effectiveness measures over these ten years. As shown in Table 1, approximately 37 percent of 
these transfer teachers are white, 34 percent black, and 26 percent Hispanic. The total percentage 
of nonwhite transfer teachers is little over 63 percent, which is about eight percent higher than 
staying teachers. About 77 percent of transfer teachers are female, compared to 83 percent of 
incumbent teachers. The average transfer teachers’ working experience in this district is 7.6 
years, which is about 3.5 years junior than the average teaching experience in the district. 
Moreover, transfer teachers are, on average, less effective than incumbent teachers (-0.21 vs. - 
0.05), less likely to have advanced degree (master’s or higher, 43 percent vs. 45 percent), but 
have fewer days absent from work (5.8 vs. 6.6). 

Analytic Strategies 

We estimate spillover effects by leveraging the peer shock to incumbent teachers due to 
new teachers’ transferring into a teacher group. Our main analyses focus on grade-level peers, 
teachers who teach the same grade in the same school and year. This peer definition allows us to 
use different fixed effects in modeling spillover effects of new peer teachers on incumbent 
teachers’ student achievement. For example, we use (a) school-grade fixed effects to control for 


stable characteristics and practices in the given grade and school (e.g., stable peer effects among 
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continuing teachers in a given school and grade) (b) school-year fixed effects to control for other 
possible school-year shocks than new peers’ entry (e.g., enhanced professional development or 
teacher collaboration in the school in a given year), (c) year fixed effects to control year-to-year 
variations in district policies that may influence teacher collaboration and student achievement, 
and (d) grade fixed effects for grade-level differences that could affect both student achievement 
and teacher transfer behaviors. 

Peer effects may expand beyond grade-level peers. This expansion may be particularly 
likely in schools with strong teacher collaborative activities. However, school-level peer 
estimates are subject to other yearly shocks to the schools that may coincide with the arrival of 
new peers and cannot be easily addressed using our data. We thus focus on grade-level peer 
effects. To demonstrate the possible spillover beyond the grade level, we show school-level 
estimates in Appendix C with a caveat of weaker identification strategies. 

There are three key challenges for identifying peer effects in literature: Common 
influences, reflection, and selection, all of which can lead to bias in the estimate of peer effects. 
If we were to use peer characteristics that were contemporaneous with the focal teacher effect, 
we would worry about common influences, for example, students having an illness at the time of 
the test or teachers’ co-participation in professional development programs. These common 
influences would affect the performance of both new transfers and incumbent teachers, and 
appear to be a peer effect. However, since we measure the peer characteristic—value-added— 
prior to when the peer and focal teacher interact, these common influence problems should not 
bias our peer effect estimates. The reflection problem is similar. It refers to the scenario when 


one individual’s outcome is influenced by others in a given period of time, and influences others 
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in the same period (Manski, 1993). Because we use the peer teacher value-added prior to when 
he or she met the focal teacher, reflection is not a problem in our case. 

The final potential source of bias—selection— is more difficult to address. Selection may 
bias the peer estimates in settings where peers self-select into peer groups in a manner that is 
unobserved to researchers. For example, new transfers may select schools with similar peers, or 
principals may assign new transfers to peer groups with similar productivity. This selection could 
cause substantial upward bias in the estimated magnitude of peer effects (Sacerdote, 2011). By 
controlling for incumbent teachers’ prior stable effectiveness and by comparing grades within 
schools within a given year, we adjust for much of this selection. We conduct falsification tests 
to examine other potential mechanisms of teacher selection in a later section in the paper, 
confirming that any resulting biases have little impact on our estimates. Moreover, we use (a) 
school-grade fixed effects to account for the time-invariant attractiveness of a particular grade in 
a school, (b) school-year fixed effects to account for time-varying attractiveness of a particular 
school in a given year, and the year-to-year variation in vacancies in a given school due to 
teacher turnover or the increase in student enrollment, and (c) time-varying lagged student 
achievement scores to account for the possibility of transfer teachers using such information to 
make their selection. 

“Linear-in-means”’: To construct the “linear-in-means” model, we model student math 
test score as a function of his or her teachers’ prior effectiveness and the average prior stable 
effectiveness of new peer teachers. In particular, we model: 

other 


Aijjgst= ora Ajjgst-1+ Q2 ijgst-1* yiXijgsrt y2Cijest 


ah Bi*Ojest-1,2,3+ B2* Okgst-1,2,3 ae SGes + Tr + Sijgst (1) 
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where Ajjgs: is the math exam score of student i, taught by incumbent teacher j in grade g, school 
s, and year t. This variable does not include new transfers’ own students’ scores on the left-hand 
side of the equation but only includes students’ test scores of incumbent teachers, so that we can 
better attribute the gain in test scores to peer effects, rather than to own teachers’ contribution to 


student achievement gains. Ajjgs--7 indicates this student’s prior year math test score and 


other 
AGasts 


,indicates his or her prior year score in the other subject (e.g., reading). 
Xigst is a vector of student 7’s characteristics, including poverty status, whether the 
student is an English language learner, the student’s race, gender, age, prior suspension, and 
prior absence. Cijgsris a vector of student 7’s classmates’ characteristics, such as percent of 
students eligible for subsidized lunch, percent of students who are English language learners, 
percent of Hispanic, African American, Asian, and white student, percent of female students, 
average age, average days of suspension, average prior scores in math and reading, and average 
days absent. 

Ojest-1,2,3 18 Student i’s own teacher /’s value-added scores averaged over prior three years 
(t-1, t-2, and t-3)—the focal(incumbent) teachers’ prior stable effectiveness. Okgsz-1,2,3 18 the 
newcomer k’s value-added scores averaged over three years prior to transferring into this school 
(t-1, ¢-2, and ¢-3); and B2 captures the “linear-in-means” estimate. SGgs is the school-grade fixed 
effects, and 7 is the year fixed effects. We also estimate the equation (1) with the combination 
of school-year, and grade fixed effects. The standard errors are clustered at the school-grade-year 
level. Eijgsr is the error term. 
“Relative effectiveness”: We then examine how the peer effects vary depending on the 


difference in effectiveness between the transfer and focal teacher —student i’s own incumbent 


teacher 7. We define “relative effectiveness” as how much more effective the new transfer k was 
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over the preceding three years than the focal teacher. “Relative ineffectiveness” is then defined 
as how much less effective the new transfer 4 was than the focal teacher. We estimate the 
effects of these two types of peers separately because we suspect differential effects of 
“relatively effective” and “relatively ineffective” peers. 
Relative effectiveness &, jgst-1,2,3= D* (Okgst-1,2,3- Ojgst-1,2,3 ) 
Relative ineffectiveness k, jgst-1,2,3= (1-D)*(Okgst-1,2,3- Ojgst-1,2,3) 
where D=1 if (Oxgst-1,2,3- Ojesr-1,2,3)>0; D=0 if (Oxest-1,2,3- Ojgst-1,2,3)<0. 
We estimate the effects of “relatively effective” and “relatively ineffective” peers using equation 
(2). 
Aijgst= ota Aijgst-1+ O2Aijgst-1 reading HY 1 Xijgst+ YoCijgst + Bi* Ojese-1,2,3 
+B2* Relative Effectiveness k, jgsi-1,2,3 + B3* Relative Ineffectiveness k, jgst-1,2,3 
+ SGgs + m1 + Eitgst (2) 
“Absolute effectiveness’’: We then test for heterogeneous effects depending on the prior 
stable effectiveness of incumbent teachers using an interaction term between incumbent teacher 


JS prior stable effectiveness and new peer k’s prior stable effectiveness. Equation (3) illustrates 


the estimation model. 


Aijjgst= ora Ajjgst-1+ a2A ijest-1 reading Yi Mijgstt y2Cijest +f Bi* Ojest-1,2,3 + B2* Okgst-1,2,3 
a B3*Ojest-1,2,3 Xx Oxgst-1,2,3 + SGe¢s + Tr + Sijgst (3) 
where 3 indicates the amount of additional gain in student 7’s test score that can be attributed to 


a new transfer teacher k, with one standard-deviation increase in own teacher j’s prior stable 


effectiveness. This interaction effect identifies what types of teachers more or less benefit from a 


high-performing peer. 
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We conduct robustness and falsification tests to examine how plausible teacher selection 
and student sorting may bias the main estimates of grade-level spillover effects. These tests are 
detailed in the next section. 

Results 
Grade-level Spillover Patterns 

For each type of spillover models, we present figures that graphically illustrate the 
patterns and regression estimates that formalize these patterns. Figure 1 shows a positive 
association of the new peers’ prior stable effectiveness with focal teachers’ student math 
achievement. This pattern is formalized in the linear-in means estimates in Table 2. The average 
effects of a one standard deviation change in the prior stable effectiveness of the new transfer 
teacher on the achievement gains of students taught by incumbent teachers in the same grade, are 
between one percent and two percent of a standard deviation of students’ math test scores. They 
are positive and mostly statistically significant at either the 0.10 or 0.05 level. These linear-in- 
means effects are between 15 percent and 29 percent of the effects of own teachers’ effects 
(0.01/0.068 or 0.020/0.069). These percentages are consistent with Jackson and Bruegmann’s 
(JB, hereafter) estimates of between 10 and 20 percent of the own teacher effect (Jackson & 
Bruegmann, 2009, p.99). 

[FIGURE 1 HERE] 
[TABLE 2 HERE] 

The size of our linear-in-means effects is somewhat smaller than JB’s estimate of 
approximately four percent of a standard deviation of math test scores, likely stemming from 
some differences in the types of peer spillovers estimated as well as the measures of teacher 


effectiveness. First, the peer measure in our study averages the effectiveness of only transfer 
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teachers and thus captures the effects from new transfer teachers to other teachers at the grade 
level, while JB’s study averages all grade-level peers and captures the peers among all grade- 
level teachers. Second, our study leverages the one-year co-working experience among teachers, 
while the JB’s peer effects may reflect co-working experience in multiple years. Third, notably 
the linear-in-means effects are estimated using teachers’ prior stable effectiveness, which results 
in the smaller point estimate than current year teacher effects. This is in similar vein of the 
estimate of own-teacher effect of 0.07 in Table 2, which is smaller than JB’s estimates of own 
teacher effect estimates in math— approximately 0.13 standard deviations. 

Figure 2 illustrates the “relative effectiveness” model. The linear fit line for cases with 
the x-axis< 0 is close to flat, which shows a very weak relationship between “relatively 
ineffective” peers and focal teachers’ student math achievement. In contrast, the linear fit line for 
cases with the x-axis>0 has a steeper, more positive slope, which indicates a much stronger 
positive association between “relatively effective” peers and focal teachers’ student achievement 
Table 2 provides estimates that formalize the differential effects of having peers who are more or 
less effective than the focal (incumbent) teacher. If a student in the class of an incumbent 
teachers has a new transfer teacher at the same grade level who is one standard deviation higher 
in prior stable effectiveness than that of their own teacher, this student would have a 1.9 or 2.8 
percent of a standard deviation increase in math test scores. This spillover effect is about 23 or 
29 percent of the student’s own teacher effect (0.019/0.081 or 0.028/0.095). Surprisingly, if the 
transfer peer teacher is about one standard deviation lower than that of own teacher, this student 
would not be meaningfully affected by the new teacher. The “relatively ineffective” estimate is 


very close to zero and not statistically different from zero. An F-test shows that the “relatively 
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effective” estimate is significantly different from the “relatively ineffective” estimate (F=6.88, 
p< 0.001). 
[FIGURE 2 HERE] 

Finally, Table 2 gives the variation of spillover effects by the absolute effectiveness of 
incumbent teachers, as indicated by the interaction term between new transfers’ prior stable 
effectiveness and own teachers’ prior stable effectiveness. It measures whether more or less 
effective incumbent teachers are differentially affected by transferring teachers. The significantly 
negative coefficients provide evidence that with one standard deviation increase in own teachers’ 
prior effectiveness, the spillover effect from new transfer peers would decrease about 0.6 percent 
or 0.8 percent of one standard deviation of student test scores. In other words, new peer teachers 
matter less for students whose own teachers were relatively more effective, or equivalently, that 
they matter more for those students whose own teachers were less effective. 

Figure 3 bases on equation (3) and plots the marginal effects of new peers on focal 
teachers’ student achievement (i.e., the predicted spillover effects) against focal teachers’ prior 
stable effectiveness. The figure confirms a substantial heterogeneity in how teachers respond to 
peers: the spillover is positive and larger for low-performing teachers, and has little effect on the 
high-performing ones. Notably, the estimated effects are negative for just a very small number of 
cases and their 95% confidence intervals all include zero, suggesting that the effectiveness of 
high-performing teachers is, on average, not particularly hurt by the presence of low-performing 
peers. 

[FIGURE 3 HERE] 
Overall, the main findings show that teachers who are newly transferred to a grade affect 


the learning of students of incumbent teachers. These effects are bigger when the new teacher is 
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more effective than the incumbent teacher, while the new teacher who is less effective has little 
impact on students’ learning in the incumbent teacher’s classrooms. The positive spillover 
effects are also bigger for less effective incumbent teachers. 
Robustness and Falsification Tests 
Teacher Sorting 

Other possible shocks to the composition of grade-level peers could affect student 
learning and bias our estimates of peer spillover. First, it is possible that a novice teacher who 
just started her/his career was employed at the same time in the same grade and school as a new 
transfer entered the team. Equations (1), (2), and (3) would drop this novice teacher and her 
students from the analysis, because a novice teacher did not have prior value-added scores on the 
right-hand side of the equations. For the same reason, the new transfer teachers without prior 
stable effectiveness, although being part of the new members of the teaching team, would be 
dropped out of the analysis too. These other new peers, including both novice teachers and new 
transfers without prior stable effectiveness, could be the omitted factor that confounds the grade- 
level phenomenon of benefiting incumbent teachers, if the entrance of these other new teachers 
is correlated with the prior performance of new transfer teachers who had prior stable 
effectiveness. To account for the influence of other new teachers, we create a continuous variable 
indicating the number of other new teachers at the same grade and add that to Equations (1)(2) 
and (3). Table 3 reports the results in the columns of “w/Other New Teachers.” The point 
estimates and standard errors of “linear-in-means”, “relatively effective/ineffective”, and 
“absolute effectiveness” effects are quite consistent with corresponding estimates the column of 


“Main Model”, as are the adjusted R-squared values.* 
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Teachers churn within schools with some teachers moving to a new grade that they did 
not teach in the year before (entry) and others moving out (exit) (Author, 2014). These churning 
teachers could affect students in much the same way as novice teachers do. To account for entry 
to a given grade, we add a dummy variable to indicate if the incumbent teachers were in a 
different grade from last year. Results are shown in the column of “w/Other New Teachers and 
Grade-churning” in Table 3. Spillover effects again do not change in any meaningful ways from 
those estimates in column of “Main Model”. 

[TABLE 3 HERE] 

A third possible shock to a given grade in a given year that could bias our estimates is 
teachers’ exit. The main concern is that an ineffective teacher’s moving out of a grade and 
school in year ¢ is followed by an effective new peer transferring in year +1. The increase in 
student achievement might stem not from the spillover of new effective transfer in year t+1 to 
incumbent teachers, but rather because of the removal of an ineffective teacher from this grade 
and school in year ¢. If there is a systematic pattern that departed teachers were, on average, less 
effective than stayed teachers in year ¢, and a new transfer was more effective than incumbent 
teachers in year t+1, the significant positive effect of “relatively effective” peers could be 
invalidated. To address this concern, we regress the difference in prior effectiveness between 
new transfers and staying teachers in year ¢+1 on the difference in prior effectiveness between 
departed teachers and staying teachers in year ¢. The point estimate is small and not significantly 
different from zero (£ = -0.005, s.e. = 0.089; p=0.959). Thus, we find no evidence that the arrival 
of a more effective new teacher to a particular grade in a school is related to the performance 
difference between departed and staying teachers in the prior year. 


Student Sorting and Other Grade-Specific Interventions 
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Besides teacher sorting, it is possible that our results are confounded by dynamic student 
sorting (or tracking) or other related grade-level interventions that cannot be fully controlled by 
lagged test scores, individual characteristics, and classmates’ characteristics. For example, the 
“relatively effective” estimate can reflect that incumbent teachers get better students and also 
lobby for better new peers. This particular sorting would be problematic. We conduct a 
falsification test by regressing a student’s test score in year ¢-1 on his/her future teacher’s value- 
added in year f, controlling for this student’s teacher effect in year ¢-1, characteristics of this 
student and his/her classmates’ characteristics, and school-grade, year fixed effects or school- 
year, grade fixed effects. If there was troubling unobservable student sorting, the coefficient of 
future teacher should be statistically significant. Although the coefficients on current teacher 
effect are about 0.11 and significant at the 1 percent level, the coefficient on future teacher’s 
effect is only 0.002 with p-value greater than 0.8. Next, to assess whether incumbent teachers 
lobby for better new peers, we regress incumbent teachers’ prior stable performance on transfer 
teachers’ prior stable performance, controlling for school fixed effects. The small coefficient of - 
0.016 is far from statistical significance (p=0.307).° Moreover, this point estimate suggests a 
negative, rather than positive relationship, which does not support the possibility that effective 
incumbent teachers lobby for effective new peers. 

Another possibility is that a principal might assign an effective new transfer to a poor 
performing grade as part of his grade-specific improvement, while this principal might 
implement other interventions at the same time (Jackson & Bruegmann, 2009). To test whether 
other grade-level specific intervention would invalidate the inference of peer spillover, we 
regress grade-level average test scores of students of incumbent teachers in year ¢-/ on the 


effectiveness of future new transfers in year ¢. The coefficients of future new transfers are -0.012 
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and far from statistical significance (p > 0.8). Taking these falsification tests together, there is 
little evidence on assigning new transfers as part of student sorting, lobbying for better new peers, 
or grade-specific interventions on student achievement. 

Other Endogeneity Problems Associated with Voluntary Teacher Transfer 

One might be still concerned about the spurious relationships between teacher self- 
selection into the school and student achievement, which cannot be fully accounted for using 
school-grade and school-year fixed effects, and lagged test scores. To further circumvent the 
problem of teacher self-selection, we leverage a unique involuntary transfer policy utilized by M- 
DCPS over a three-year period. 

In the 2010, 2011, and 2012 school years, M-CDPS exercised a clause in its Collective 
Bargaining Agreement (CBA) allowing for the transfer of teachers involuntarily within the 
district (Author, 2014). In the summer prior to each of school years, principals provided regional 
administrators with the lists of teachers they wanted to transfer out of their schools, which were 
then forwarded to the Instructional Staffing division in the district central office, who sought a 
new placement for each teacher. The placement takes into account staffing needs of the receiving 
schools, and, in some cases, input from regional administrators, but no input from transferred 
teachers themselves. In each year, transferred teachers were notified of the transfers and the new 
placements at the very end of the summer—in many cases not until the week before the start of 
school. There was no time for transferred teachers to appeal, so almost all teachers complied 
with their new placements. The average effectiveness of involuntary transfers can be considered 
plausibly exogenous. 

Among 153 elementary and middle school teachers who were moved involuntarily over 


these three years, there were 46 math teachers with value-added. We apply Equation (1), (2), and 


24 


(3) to this subsample of teachers. Results are included in Table 4 and show similar patterns of 
spillover effect as in our main sample that includes both voluntary and involuntary transfers. 
While the point estimates are not statistically significant, the small sample of involuntary 
transfers leads to larger standard error estimates and the loss of precision of the estimation. 
However, the magnitude and direction of the estimated spillover effects are very similar to the 
main findings in Table 2. Namely, the “linear-in-means” estimate is 0.015, the “relatively 
effective” estimate is 0.031, the “relatively ineffective” estimate is -0.009, and the “absolute 
effectiveness” estimate is -0.014, in comparison to the estimates in Table 2— 0.02, 0.028, -0.009, 
and -0.008, respectively. 
[TABLE 4 HERE] 

Spillover Effects in Elementary and Middle Grades Separately 

Elementary and middle grades have different organizational structures that may influence 
peer formation and influence among teachers. First, elementary teachers are often assigned to 
work with a particular group of students, while teachers in middle grades are often responsible 
for multiple groups of students. Second, collaboration among elementary teachers is more 
common within grades, while collaboration among secondary teachers is more common across 
grade levels, but within subject areas. These differences in the sharing of students and 
collaboration structure between elementary and middle grades may suggest differential spillover 
effects in elementary and middle grades. 

We estimate the spillover effects separately by elementary and middle grades and provide 
the results in Table 5. Most of the results are quite similar across grade levels and to the pooled 
estimates (i.e., the main effects); but there is more variation across models when examined 


separately by school levels. In particular, for elementary grades, the Model 1 with school-grade 
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and year fixed effects gives estimates that are very similar to the pooled estimates, while the 
Model 2 with the school-year and grade fixed effects gives estimates that are generally lower in 
magnitude, with the exception of the “relatively ineffective” and “absolute effectiveness” 
estimates. For the middle grades, on the other hand, the school-year and grade fixed effects 
estimates tend to be larger in magnitude than those with school-grade and year fixed effects. The 
reason for these differences may be that elementary grades typically have fewer new transfers in 
a given year and thus less variation in transfer teachers’ effectiveness within a school and year. 
Therefore, the estimates from the school-year grade model that relies on the variation within a 
school year consistently generate smaller estimates than the school-grade year model that relies 
on the variation over time. In contrast, middle school grades may have more teachers transferred 
in a given school and year, and therefore, the Model 2 that uses the variation in transferred 
teachers’ effectiveness within a school and year consistently generate larger estimates. 
[TABLE 5 HERE] 

Discussion and Conclusion 

Teaching is often considered as an isolated practice with little interactions among 
teachers (e.g., Lortie, 1975). Yet, recent reforms have worked to increase teacher collaboration, 
and some recent work has demonstrated effects of teachers on each other (e.g., Author, 2013; 
Jackson & Bruegmann, 2009). Our study looks at the effects of new transfer teachers to grade- 
level teams, asking specifically whether having a more effective teacher entering the grade 
improves the learning of students of teachers who have remained in that grade level before and 
after the new teacher entered. Overall, we find strong and consistent evidence of positive 


spillover effects, as more effective teachers boost students of other teachers in the grade. 


26 


In particular, if a student has a new peer teacher at the same grade level who is about one 
standard deviation more effective than that of own teacher, this student would have a 1.9 or 2.8 
percent of a standard deviation increase in math test scores. This spillover effect is about 23% to 
29% of this student’s own teacher effect. This positive spillover effect from relatively effective 
peers is robust to (a) using school-grade fixed effects to account for the time-invariant 
attractiveness of a particular school and grade, or (b) using school-year fixed effects to account 
for time-varying attractiveness of a particular school and yearly variations in school conditions. 
We also present a variety of falsification tests to show that the results are probably not biased by 
nonrandom student sorting and by endogenous teachers’ movement across grades and schools. 

Although relatively effective peers have positive spillovers, students of incumbent 
teachers are not particularly disadvantaged by the presence of relatively ineffective peers. 
Moreover, low-performing teachers seem more responsive to the composition of peers than high- 
performing teachers. With one standard deviation decrease in own teachers’ prior effectiveness, 
the spillover effect from new transfers would increase about 0.6 percent or 0.8 percent of one 
standard deviation of student test scores. These findings are important, because they imply that 
strategic grouping of teachers to potentially maximize all students’ learning in aggregate is to 
pair ineffective teachers with more effective colleagues. 

Although this study could not provide direct evidence on spillover mechanisms, the 
significant “relatively effective” estimates support the explanations of both knowledge spillover 
from effective teachers to less effective colleagues, and motivations from peer pressures and 
prosocial preferences. Moreover, the findings on “linear-in-means” model somewhat support the 
joint production explanation in that with one standard deviation increase in the new peer’s 


effectiveness, the teacher team will increase their average productivity by about 1 percent or 2 
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percent of one standard deviation of students’ math test scores, although the significance of this 
effect varies across model specifications. 

Future studies can continue to examine what drives spillover, so that policymakers can 
better design teacher incentive and development programs or manipulating teacher assignments 
to magnify effective teachers’ contribution to the whole teacher team’s performance. For 
example, if knowledge spillover is the predominant mechanism, school leaders can organize 
effective professional development programs and professional learning communities within 
schools to facilitate the diffusion of instructional expertise (e.g., Author, 2013). If motivation is 
the primary mechanism, strategies of making effective teachers visible to their colleagues, such 
as recognizing effective teachers through differential pay or career ladder system, could be useful. 
Despite the difficulties in re-arranging teachers and manipulating peer effects, we hope this study 
will promote the discussions among researchers and school leaders about the optimal grouping of 
teachers within and across schools to achieve larger gains in student achievement, which goes 


above and beyond effective teachers’ impacts in their own classrooms. 
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Notes 

1. Ifa teacher had multiple grades with the same number of students, we use the lowest grade. 
This applies to 1.7 percent of teachers in our sample—a very small fraction of our sample. 

2. The results are available from the authors upon request. 

3. This aggregated measure includes teachers who have three lagged value-added measures 
(about 21%), and those who only have two (27%) or one lagged value-added (52%). To examine 
how our estimates of spillover effects vary depending on the number of lagged value-added, we 
use either the most recent lagged value-added and the most recent two lagged value-added in the 
estimation. Our results, which are available upon request, do not differ in any meaningful way 
from those presented in the result section. 

4. We also try specifications that include an indicator for having any of other new teachers 
instead of the number of other new teachers and find very similar results. Detailed results are 
available upon request. 

5. This coefficient is relatively consistent across different model specifications with different 
fixed effects. For example, the school-grade year combination yields an estimate of -.017 
(s.e.=0.02, p=0.84), and the school-year grade combination yields an estimate of -0.048 


(s.c.=0.03, p=1.58). 


29 


References 


Aaronson, D., Barrow, L., & Sander, W. (2007). Teachers and student achievement in the 
Chicago public high schools. Journal of Labor Economics, 25(1), 95-135. 


Author. (2012). 
Author. (2013). 
Author. (2014). 
Author. (2015). 


Battu, H., Belfield, C.R., & Sloane, P.J. (2003). Human capital spillovers within the workplace: 
Evidence from Great Britain. Oxford Bulletin of Economics and Statistics, 65(5), 575-594. 


Bauer, T. K., & Vorell, M. (2010). External effects of education: Human capital spillovers in 
regions and firms (RUHR #195). Bochum, Germany: Ruhr-Universitat Bochum (RUB). 


Carrell, S. E., Fullerton R.L., & West, J. E. (2009). Does your cohort matter? Measuring peer 
effects in college achievement. Journal of Labor Economics, 27(3), 439-464. 


Coburn, C. E., & Russell, J. L. (2008). District policy and teachers’ social networks. Educational 
Evaluation and Policy Analysis, 30 (3), 203-235. 


Conley, T. G., & Udry, C. R. (2010). Learning about a new technology: Pineapple in 
Ghana. American Economic Review, 100(1), 35-69. 


Cornelissen, T., Dustmann, C., & Schénberg, U. (2013). Peer effects in the workplace (IZA 
Discussion Paper No. 7617). Retrieved from http://ftp.iza.org/dp7617.pdf 


De Grip, A., & Sauermann, J. (2012). The effects of training on own and co-worker productivity: 
Evidence from a field experiment. The Economic Journal, 122(560), 376-399. 


Glazerman, S., Protik, A., Teh, B., Bruch, J., & Max, J. (2013). Transfer incentives for High- 
performing Teachers: Final results from a multisite experiment (NCEE 2014-4003). 
Washington, DC: National Center for Education Evaluation and Regional Assistance, 
Institute of Education Sciences, U.S. Department of Education. 


Graham B. S. (2008). Identifying social interactions through conditional variance restrictions. 
Econometrica, 76, 643-660. 


Guarino, C., Reckase, M., Stacy, B., & Wooldridge, J.(2015). A comparison of student growth 
percentile and value-added models of teacher performance. Statistics and Public Policy. 
2(1), retrieved at http://dx.doi.org/10.1080/2330443X.2015.1034820. 


Hansen, M., Backes, B., Brady, V., & Xu, Z. (2014). Examining spillover effects from Teach for 
America corps members in Miami-Dade County Public Schools. (CALDER Working 
Paper No.113). Washington, D.C.: National Center for Analysis of Longitudinal Data in 
Education Research, American Institute for Research. 


Hoxby, C. M., & Weingarth, G. (2005). Taking race out of the equation: School reassignment 
and the structure of peer effects. retrieved from 


https://www.aeaweb.org/assa/2006/0108 1300 _0803.pdf. 


30 


Imberman, S. A., Kugler, A. D., & Sacerdote, B. I. (2012). Katrina’s children: Evidence on the 
structure of peer effects from hurricane evacuees. American Economic Review, 102(5): 
2048-2082. 


Jackson, C.K., & Bruegmann, E. (2009). Teaching students and teaching each other: The 
importance of peer learning for teachers. American Economic Journal: Applied 
Economics, 1(4), 85—108. 


Koretz, D. M. (2002). Limitations in the use of achievement tests as measures of educators’ 
productivity. Journal of Human Resources, 37, 752-777. 


Kuroda, S., & Yamamoto, I. (2013). Do peers affect determination of work hours? Evidence 
based on unique employee data from global Japanese firms in Europe. Journal of Labor 
Research, 34(3), 359-388 


Lortie, D.C. (1975). Schoolteacher: A sociological study. Chicago, IL: University Of Chicago 
Press 


Manski, C.F. (1993). Identification of endogenous social effects: The reflection problem. The 
Review of Economic Studies, 60(3), 531-542. 


Mas, A., & Moretti, E. (2009). Peers at work. American Economic Review, 99(1): 112-45. 


McCaffrey, D. F., Lockwood, J. R., Kortez, D. M., Louis, T. A., & Hamilton, L. S. (2004). 
Models for value-added modeling of teacher effects. Journal of Educational and 
Behavioral Statistics, 29, 67-102 


McCaffrey, D.F., Sass, T. R., Lockwood, J.R., & Mihaly, K. (2009). The intertemporal 
variability of teacher effect estimates. Education Finance and Policy, 4(4), 572-606. 


Nye, B., Konstantopoulos, S., & Hedges, L.V. (2004). How large are teacher effects? 
Educational Evaluation and Policy Analysis, 26(3), 237-257. 


Rivkin, S. G., Hanushek, E. A., & Kain, J.F. (2005). Teachers, schools, and academic 
achievement. Econometrica, 73(2), 417-58. 


Rockoff, J. E. (2004). The impact of individual teachers on student achievement: Evidence from 
panel data. American Economic Review, 94(2), 247-52. 


Sacerdote, B. (2001). Peer effects with random assignment: Results for Dartmouth roommates. 
The Quarterly Journal of Economics, 116(2). 681-704. 


Sacerdote, B. (2011). Peer effects in education: How might they work, how big are they and how 
much do we know thus far? Handbook of the Economics of Education, 3, 249-277. 


Sacerdote, B. (2014). Experimental and quasi-experimental analysis of peer effects: Two steps 
forward? Annual Review of Economics, 6(1), 253-272. 


Sanders, W. L., & Rivers, J.C. (1996). Cumulative and residual effects of teachers on future 
student academic achievement. Research Progress Report. Knoxville: University of 
Tennessee Value-Added Research and Assessment Center. 


Stoyanov, A., & Zubanov. N. (2012). Productivity spillovers across firms through worker 
mobility. American Economic Journal: Applied Economics, 4(2), 168-98. 


31 


Summers, A.A., & Wolfe, B. L. (1977). Do schools make a difference? American Economic 
Review, 6(4), 639-652. 


Xu Z., Ozek, U., & Corritore, M. (2012). Portability of teacher effectiveness across school 
settings (CALDER Working Paper No.77). Washington, D.C.: National Center for 
Analysis of Longitudinal Data in Education Research, American Institute for Research. 


32 


Figures and Tables 


Figure 1. “Linear-in-means”, the association of new peer teachers’ prior stable effectiveness 
with the student achievement of focal (incumbent) teachers 
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Note. This figure plots the linear relationship between new peer teachers’ prior stable effectiveness and the average 
student achievement of focal (incumbent) teachers, after controlling for focal teachers’ own prior stable 
effectiveness, based on Equation (1). 
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Figure 2. “Relative effectiveness,” comparing the slope of “relative effective” peers in 
predicting the student achievement of focal (incumbent) teachers with that of the “relatively 
ineffective” peers 
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Note. This figure bases on Equation (2). The x-axis of “relative effectiveness” is the difference in prior stable 
effectiveness between new peers and the incumbent teacher himself or herself. The linear fit line for cases with the 
x-axis< 0 is close to flat, which shows a very weak relationship between “relatively ineffective” peers and focal 
teachers’ student math achievement. In contrast, the linear fit line for cases with the x-axis>0 has a steeper, positive 
slope, which indicates a much stronger positive association between “relatively effective” peers and focal teachers’ 
student achievement. 
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Figure 3. “Absolute effectiveness,” the relationship between focal-teacher-specific spillover 
effects and their own prior stable effectiveness 
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Note. This figure bases on Equation (3) and plots the marginal effects of new peers on focal teachers’ student 
achievement against focal teachers’ prior stable effectiveness. This figure confirms that the spillover is larger for 
less effective teachers and smaller for more effective teachers. Notably, the spillover effect estimates are negative 
for a small number of cases and their 95% of the confidence intervals all include zero, indicating that effective focal 
teachers are not significantly disadvantaged. 
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Table 1. Descriptive statistics for M-DCPS students and teachers 


Variables Mean 
Student-year observations 
Math scores 0.015 
Race/ethnicity: White 0.085 
Black 0.247 
Hispanic 0.647 
Other 0.021 
Female 0.486 
English language learner 0.125 
Eligible for free- and reduced-price lunch (FRL) 0.722 
Special education 0.117 
N 1,150,468 
Transfer teacher-year observations 
Female 0.770 
Race/ethnicity: White 0.368 
Black 0.344 
Hispanic 0.267 
Other 0.021 
Advanced degree (Master’s or higher) 0.432 
Standardized value-added -0.205 
Teaching experience in this school district 7.629 
Total days of absence 5.825 
N 1,594 
Incumbent teacher- year observation 
Female 0.832 
Race/ethnicity: White 0.456 
Black 0.296 
Hispanic 0.237 
Other 0.015 
Advanced degree (Master’s or higher) 0.453 
Standardized value-added -0.046 
Teaching experience in this school district 11.169 
Total days of absence 6.581 
N 26,346 


SD 


0.975 
0.280 
0.431 
0.478 
0.142 
0.50 
0.330 
0.448 
0.321 


0.421 
0.482 
0.475 
0.443 
0.142 
0.495 
0.947 
6.614 
5.613 


0.374 
0.498 
0.456 
0.423 
0.121 
0.498 
0.937 
8.515 
6.194 
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Table 2. Estimated grade-level spillover effects 


Model-1 Model-2 Model-1 Model-2. Model-1 Model-2 


“Linear-in-means”:Prior stable — 0.010° 0.020* 
effectiveness of new transfer (0.006) (0.009) 0.010 0.021* 
peers , (0.006) (0.009) 


“Relatively effective”: Positive 
values of the difference in 
prior stable effectiveness 
between new transfer peers 
and focal teachers 

“Relatively ineffective”: 
Negative values of the 
difference in prior 
effectiveness between new 
transfers and focal teachers 

F-statistics for the difference 
between “relatively 
effective” and “relatively 
ineffective” 

“Absolute effectiveness”: Prior 
stable effectiveness of new 
transfer peers < Prior stable 
effectiveness of focal 


0.019*** — 0,028*#* 
(0.005) (0.008) 


0.002 -0.009 
(0.006) (0.008) 


3.771 6.88** 


-0.006  -0.008* 
(0.004) (0.004) 


teachers 
Students’ own (focal) teachers’ 0.068*** 0.069***  0.081***  0.095*** 0.069*** 0.069*** 
prior effectiveness (0.006) (0.006) (0.008) (0.011) (0.005) (0.005) 
N 109,422 109,422 109,422 109,422 109,422 109,422 


adj. R-sq 0.687 0.690 0.687 0.690 0.687 0.690 


Note: data from 2003-04 to 2012-13. 

All models include student and classroom covariates. Student covariates include eligibility for free or reduced price 
lunch (FRL), whether the student is an English language learner, the student’s race/ethnicity, gender, age, prior 
suspension, prior absence, and prior math and reading test scores. Classroom covariates include % of FRL, % of 
students are English language learners, % of Hispanic, % of African American, % of Asian, % of White, % of 
female, average age, average days of prior suspension, average days of prior absence, and the average and standard 
deviation of students’ prior math and reading test scores. 

Model-1 includes school-grade, year fixed effects; Model-2 includes school-year, grade fixed effects. 

Standard errors are included in the parentheses and clustered at the school-grade-year level. 


¥ pp <0.1 * p <0.05 ** p <0.01 *** p <0.001 
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Table 3. Robustness check on teacher sorting 


Main Model w/ Other New w/ Other New 
Teachers Teachers and Grade- 
churning 


Model-1 Model-2  Model-1 Model-2 Model-1 Model-2 
Panel 1: “Linear-in-means” 


“Linear-in-means” 0.0107 0.020* 0.009 0.020* 0.011* 0.025* 
(0.006) (0.009) (0.006) (0.010) (0.005) (0.010) 
Students’ own (focal) teachers’ prior stable 0.068*** 0.069*** 0.069*** 0.069***  0.073***  0.072*** 
effectiveness (0.006) (0.006) (0.005) (0.006) (0.006) (0.007) 
Number of other new teachers (novice teachers or -0.006 -0.002 0.02 0.057" 
new transfer teachers without prior stable (0.017) (0.033) (0.019) (0.033) 
effectiveness) at the grade level 
Own teacher was in a different grade last year -0.009 -0.017 
(0.014) (0.015) 
N 109,422 109,422 109,422 109,422 76,107 76,107 
adj. R-sq 0.687 0.690 0.687 0.690 0.692 0.693 
Panel 2: “Relative effectiveness” 
“Relatively effective” 0.019*** 0.028*** 0.019*** 0.028*** 0.019** — 0.035*** 
(0.005) (0.008) (0.005) (0.008) (0.006) (0.009) 
“Relatively ineffective” 0.002 -0.009 0.002 -0.009 -0.001 -0.013 
(0.006) (0.008) (0.006) (0.008) (0.005) (0.009) 
F-statistics 3.77" 6.88** 3.658" 6.56** 7.79%" 10.14** 
Students’ own (focal) teachers’ prior stable 0.081*** 0.095*** 0.081***  0.095***  0.087*** 0.104*** 
effectiveness (0.008) (0.011) (0.008) (0.011) (0.008) (0.012) 
Number of other new teachers (novice teachers or -0.005 -0.002 0.019 0.0597 
new transfer teachers without prior stable (0.017) (0.033) (0.019) (0.033) 
effectiveness) at the grade level 
Own teacher was in a different grade last year -0.010 -0.017 
(0.014) (0.014) 
N 109,422 109,422 109,422 109,422 76,107 761,07 
adj. R-sq 0.687 0.690 0.687 0.690 0.692 0.693 
Panel 3: “Absolute effectiveness” 
“Linear-in-means” 0.010 0.021* 0.010 0.021* 0.012** 0.026* 
(0.006) (0.009) (0.006) (0.009) (0.005) (0.009) 
“Absolute effectiveness”: -0.006 -0.008* -0.006 -0.008* -0.006 -0.008 
New transfer peers x Focal teachers (0.004) (0.004) (0.004) (0.004) (0.004) (0.005) 
Students’ own (focal) teachers’ prior stable 0.069*** 0.069*** 0.069*** 0.069***  0.072***  0.071*** 
effectiveness (0.005) (0.005) (0.005) (0.005) (0.006) (0.007) 
Number of other new teachers (novice teachers or -0.006 -0.003 0.018 -0.058" 
new transfer teachers without prior stable (0.017) (0.033) (0.019) (0.033) 
effectiveness) at the grade level 
Own teacher was in a different grade last year -0.008 -0.016 
(0.014) (0.014) 
N 109,422 109,422 109,422 109,422 76,107 76,107 
adj. R-sq 0.687 0.690 0.687 0.690 0.692 0.693 


Note: data from 2003-04 to 2012-13. t) <0.1 * p <0.05 ** p <0.01 *** p <0.001 

All models include student and classroom covariates. Student covariates include eligibility for free or reduced price 
lunch (FRL), whether the student is an English language learner, the student’s race/ethnicity, gender, age, prior 
suspension, prior absence, and prior math and reading test scores. Classroom covariates include % of FRL, % of 
students are English language learners, % of Hispanic, % of African American, % of Asian, % of White, % of 
female, average age, average days of prior suspension, average days of prior absence, and the average and standard 
deviation of students’ prior math and reading test scores. 

Model-1 includes school-grade, year fixed effects; Model-2 includes school-year, grade fixed effects. 

Standard errors are included in the parentheses and clustered at the school-grade-year level. 
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Table 4. Estimated spillover effects of involuntary transfers 


“Linear-in-means”: Prior stable effectiveness of new 
transfer peers 


“Relatively effective”: Positive values of the difference 
in prior stable effectiveness between new transfer 
peers and focal teachers 

“Relatively ineffective”: Negative values of the 
difference in prior effectiveness between new transfers 
and focal teachers 

F-statistics for the difference between “relatively 
effective” and “relatively ineffective” 

“Absolute effectiveness”: Prior stable effectiveness of 
new transfer peers = Prior stable effectiveness of focal 
teachers 

Students’ own (focal) teachers’ prior effectiveness 


N 
adj. R-sq 
Note: data from 2019-10, 2010-11, 2011-12 


(1) 
0.015 
(0.017) 


0.038* 
(0.014) 
5,644 
0.663 


(2) 


0.031 
(0.022) 


-0.009 
(0.014) 


1.99 


0.066% 
(0.025) 
5,644 
0.663 


(3) 
0.019 
(0.018) 


-0.014 
(0.025) 


0.038* 
(0.014) 
5,644 
0.663 


All models include student, classroom, and school covariates. Student covariates include eligibility for free or 
reduced price lunch (FRL), whether the student is an English language learner, the student’s race/ethnicity, gender, 
age, prior suspension, prior absence, and prior math and reading test scores. Classroom and school covariates 
include % of FRL, % of students are English language learners, % of Hispanic, % of African American, % of 
Asian, % of White, % of female, average age, average days of prior suspension, average days of prior absence, and 


the average and standard deviation of students’ prior math and reading test scores. 


Grade and year fixed effects are included. 


Standard errors are included in the parentheses and clustered at school-grade-year level. 


¥ pp <0.1 * p <0.05 ** p <0.01 *** p <0.001 
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Table 5. Estimated grade-level spillover effects in elementary and secondary schools separately 


Elementary 
Model 1 Model2 Model 1 Model2 

“Linear-in- 0.021* 0.001 

means” (0.008) (0.018) 
“Relatively 0.026** 0.013 

effective” (0.008) (0.014) 
“Relatively -0.008 0.011 

ineffective” (0.010) (0.017) 
F-statistics 6.45* 0.01 
“Absolute 

effectiveness 

”: New 

transfer peers 

x Focal 

teachers 
Students’ own 0.090*** 0.091*** 0.112*** 0.091*** 

(focal) (0.008) (0.008) (0.011) (0.020) 

teachers’ prior 

effectiveness 
N 41,178 41,178 41,178 41,178 
adj. R-sq 0.666 0.671 0.666 0.671 


Note: data from 2003-04 to 2012-13. 


Model 1 Model 2 | Model 1 
0.022* 0.000 0.005 

(0.009) (0.018) (0.007) 
-0.010 -0.0127 

(0.006) (0.007) 

0.089***  0.088*** | 0.052*** 
(0.008) (0.008) (0.007) 
41,178 41,178 68,244 
0.666 0.671 0.698 


Middle 

Model2 Model 1 Model 2 
0.021+ 
(0.013) 

0.016* 0.029** 

(0.007) (0.010) 

0.006 -0.010 

(0.008) (0.011) 

0.94 4.50* 
0.054*** 0.064*** 0.082*** 
(0.007) (0.010) (0.015) 
68,244 68,244 68,244 
0.700 0.697 0.700 


Model 1 Model 2 
0.005 0.022+ 
(0.007) (0.012) 
-0.004 -0.006 
(0.004) (0.004) 
0.055***  (0.054*** 
(0.007) (0.007) 
68,244 68,244 
0.697 0.700 


All models include student and classroom covariates. Student covariates include eligibility for free or reduced price lunch (FRL), whether the student is an 
English language learner, the student’s race/ethnicity, gender, age, prior suspension, prior absence, and prior math and reading test scores. Classroom covariates 
include % of FRL, % of students are English language learners, % of Hispanic, % of African American, % of Asian, % of White, % of female, average age, 


average days of prior suspension, average days of prior absence, and the average and standard deviation of students’ prior math and reading test scores. 


Model-1 includes school-grade, year fixed effects; Model-2 includes school-year, grade fixed effects. 
Standard errors are included in the parentheses and clustered at the school-grade-year level. 


¥ pp <0.1 * p <0.05 ** p <0.01 *** p <0.001 
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Appendix A 


Table Al provides the estimates for the four types of spillover effects on students’ 
reading achievement. The different model specifications test the robustness of results to teacher 
selection issues as discussed in the section of “Teacher Sorting”. Overall, the “linear-in-means”, 
“relatively effective”, and “relatively ineffective” effects are indistinguishable from zero. This 
absence of spillover effects on reading test scores is anticipated for at least two reasons. First, 
teachers generally have a stronger influence on math than reading test scores, as identified in 
prior studies (e.g., Nye, Konstantopoulos, & Hedges, 2004) and also confirmed by our study in 
that the estimated effects of own teachers on reading test scores are only about one third to one 
half of the estimated effects of own teachers on math test scores. Second, one of our prior studies 
reveals a much stronger positive association of teachers’ instructional collaboration with their 
math value-added than reading value-added in M-DCPS (Author, 2015, pp.28-30). 

However, the “absolute effectiveness” model in reading shows similar inferences to those 
identified in math. With one standard deviation increase in own teachers’ prior effectiveness, the 
spillover effect from new transfer peers would decrease about 0.6 percent or 1 percent of one 
standard deviation of student test scores. In other words, new peer teachers matter less for 
students whose own teachers were relatively more effective, or equivalently, that they matter 
more for those students whose own teachers were less effective. 
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Table Al. Estimated spillover effects on reading test scores 


Main Model w/ Other New w/ Other New 
Teachers Teachers and Grade- 
churning 


Model-1 Model-2. Model-1 Model-2.  ~Model-1 Model-2 
Panel 1: “Linear-in-means” 


“Linear-in-means” -0.0077 0.003 -0.006 0.003 -0.002 0.003 
(0.004) (0.007) (0.004) (0.007) (0.004) (0.009) 
Students’ own (focal) teachers’ prior stable 0.028***  0.033***  0.028*** —0.033*** 0.034*** — 0.03 9*** 
effectiveness (0.004) (0.004) (0.004) (0.004) (0.004) (0.005) 
Number of other new teachers (novice teachers or 
new transfer teachers without prior stable -0.010 -0.011 -0.014 -0.026 
effectiveness) at the grade level (0.011) (0.013) (0.005) (0.018) 
Own teacher was in a different grade last year 0.010 0.014 
(0.008) (0.009) 
N 110,587 110,587 110,587 110,587 — 80,365 80,365 
adj. R-sq 0.660 0.662 0.660 0.662 0.656 0.657 
Panel 2: “Relative effectiveness” 
“Relatively effective” -0.002 0.008 -0.002 0.008 0.002 0.008 
(0.004) (0.005) (0.004) (0.005) (0.004) (0.007) 
“Relatively ineffective” 0.0077 0.006 0.007% 0.006 0.007 0.008 
(0.004) (0.005) (0.004) (0.007) (0.005) (0.009) 
F-statistics 2.35 0.02 2.29 0.03 0.54 0.00 
Students’ own (focal) teachers’ prior stable 0.023***  0.034***  0.023***  0.034*** = 0.032*** — 0.039*** 
effectiveness (0.005) (0.008) (0.005) (0.008) (0.006) (0.009) 
Number of other new teachers (novice teachers or 
new transfer teachers without prior stable -0.010 -0.011 -0.015 -0.027 
effectiveness) at the grade level (0.011) (0.013) (0.011) (0.008) 
Own teacher was in a different grade last year 0.010 0.014 
(0.008) (0.009) 
N 110,587 110,587 110,587 110,587 — 80,365 80,365 
adj. R-sq 0.660 0.662 0.660 0.662 0.656 0.657 
Panel 3: “Absolute effectiveness” 
“Linear-in-means” -0.006 0.002 -0.006 0.002 -0.003 0.002 
(0.004) (0.007) (0.004) (0.007) (0.004) (0.009) 
“Absolute effectiveness”: -0.006 — -0.010** — -0.006* —--0.010** + -0.006 ~—--0.013*** 
New transfer peers < Focal teachers (0.003) (0.003) (0.003) (0.003) (0.004) (0.004) 
Students’ own (focal) teachers’ prior stable 0.029*** 0.033***  0.029*** 0.033*** 0.035*** —0.039*** 
effectiveness (0.004) (0.004) (0.004) (0.0043 (0.004) (0.004) 
Number of other new teachers (novice teachers or 
new transfer teachers without prior stable -0.010 -0.009 -0.014 -0.024 
effectiveness) at the grade level (0.011) (0.013) (0.011) (0.009) 
Own teacher was in a different grade last year 0.010 0.014 
(0.008) (0.009) 
N 110,587 110,587 110,587 110,587 = 80,365 80,365 
adj. R-sq 0.660 0.662 0.660 0.662 0.656 0.657 


Note: data from 2003-04 to 2012-13. *p <0.1 * p <0.05 ** p <0.01 *** p <0.001 

All models include student and classroom covariates. Student covariates include eligibility for free or reduced price 
lunch (FRL), whether the student is an English language learner, the student’s race/ethnicity, gender, age, prior 
suspension, prior absence, and prior math and reading test scores. Classroom covariates include % of FRL, % of 
students are English language learners, % of Hispanic, % of African American, % of Asian, % of White, % of 
female, average age, average days of prior suspension, average days of prior absence, and the average and standard 
deviation of students’ prior math and reading test scores. 

Model-1 includes school-grade, year fixed effects; Model-2 includes school-year, grade fixed effects. 

Standard errors are included in the parentheses and clustered at the school-grade-year level. 
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Appendix B 


Equation (B1) describes the teacher value-added model, which predicts the achievement 
of student i, taught by teacher / in school s in year ¢ as a function of his/her prior achievement, 
time-invariant and time-varying student characteristics, classroom characteristics, time-varying 
school characteristics, and a teacher-by-year fixed effect. 


Aijjs= oto Aijst-7+ a2 Aginer by Xijstt YoCjst + y3Sse + Ojet + Sijst (B1) 


where Ajsr is the math or reading exam score of student i, taught by incumbent teacher /, in 
school s and year ¢. The test scores used to generate the value-added estimates are the scaled 
scores from the Florida Comprehensive Assessment Test (FCAT), standardized to have a mean 
of 0 and a standard deviation of 1 for each grade in each year. Superscripts of subjects are 
omitted for simplicity, but we estimate equation (B1) separately in math and reading. Ajes:- 
indicates this student’s prior year subject test score and Agen y indicate his or her prior year 
score in the other subject (e.g., if modeling math achievement, then reading would be the other 
subject). 

Xijs: iS a vector of student 7’s characteristics, including poverty status, whether the student 
is an English language learner or in special education programs, the student’s race, gender, and 
prior absence. Cjs:is a vector of classmates’ characteristics, such as percent of students eligible 
for subsidized lunch, percent of students who are English language learners or in special 
education programs, percent of Hispanic, African American, Asian, and other students, percent 
of female students, average prior scores in math and reading, and average days absent. Ss: is a 
vector of school characteristics, including percent of students eligible for subsidized lunch, 
percent of students who are English language learners or in special education programs, percent 
of Hispanic, African American, Asian, and other students, percent of female students, average 
prior scores in math and reading, and average days absent. 

0: reflects the contribution of a given teacher to student achievement in each year, after 
controlling for all observed time-varying student, classroom, and school characteristics, and 
time-invariant student characteristics that may be associated with learning. Since we use the 
grades (3 to 8) as our reference groups, the estimates also indicate a teacher’s deviation from the 
average teacher in the grade. 

After estimating equation (B1), we shrink the teacher-by-year fixed effect estimates using 
the empirical Bayes methods to adjust for sampling and measurement errors and bring imprecise 
estimates closer to the mean (see Author, 2012, for a description of the shrinking). After 
shrinking the value-added estimates, we standardize them to have a mean of 0 and a standard 
deviation of | in each year to facilitate interpretation. 
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Appendix C 


To test the stability of grade-level spillover patterns on students’ math achievement, we 
expand the definition of peers to all teachers who taught in the same school in the same year. We 
anticipate similar directions of spillover effects, but a decrease in the magnitude from the grade- 
level effects, because a teacher shares less of the common production processes and direct 
interactions with peers schoolwide than those taught the same grade. As aforementioned, the 
identification strategy for schoolwide spillover may suffer from school-year specific shocks that 
cannot be captured by controlling for student characteristics and their school-average 
performance. We thus alert the readers to the weaker internal validity. 

The “linear-in-means” effects remain positive but insignificant, as shown in Table C1. 
The magnitude of “linear-in-means” effects drops to 0.003 from 0.006 of the corresponding 
grade-level estimates. The “relatively effective” estimate is consistently significantly positive, 
with an estimate of 1.7 percent of standard deviation increase in student achievement if the prior 
stable performance of new transfers is one standard deviation higher than that of own teacher. 
The “relatively ineffective” estimate is 0.004 and insignificant, similar as grade-level estimates. 
The “absolute effectiveness” estimate is -0.003, but insignificant. Overall, the school level 
models give somewhat similar effects even with some potential bias. 

Salient disparities of teacher spillovers are observed between elementary and middle 
grades. As shown in Table C2, the “linear-in-means” estimate at elementary grades is about 
0.015, significant at the 0.05 level, while the middle school estimates are close to zero and not 
statistically significant. Similarly, the “relatively effective” estimate for elementary grades is 
0.024 and statistically significant, while the estimates for middle school are, again, small and not 
distinguishable from zero. Given the worries with bias in the school level model, we do not put 
too much weight on these differences. 
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Table C1. Estimated school-level spillover effects 


(1) (2) (3) 

“Linear-in-means”: Prior stable effectiveness of new 0.003 0.003 

transfer peers (0.005) (0.005) 
“Relatively effective”: Positive values of the difference in 0.017* 

prior stable effectiveness between new transfer peers and (0.007) 

focal teachers 
“Relatively ineffective”: Negative values of the difference in 0.004 

prior effectiveness between new transfers and focal (0.006) 

teachers 
F-statistics for the difference between “relatively effective” 1.95 

and “relatively ineffective” 
“Absolute effectiveness”: Prior stable effectiveness of new -0.003 

transfer peers < Prior stable effectiveness of focal teachers (0.003) 
Students’ own (focal) teachers’ prior effectiveness 0.102*** 0.101*** 0.106*** 

(0.004) (0.004) (0.006) 

N 320,715 320,715 320,715 
adj. R-sq 0.666 0.666 0.666 


Note: data from 2003-04 to 2012-13 

All models include student, classroom, and school covariates. Student covariates include eligibility for free or 
reduced price lunch (FRL), whether the student is an English language learner, the student’s race/ethnicity, gender, 
age, prior suspension, prior absence, and prior math and reading test scores. Classroom and school covariates 
include % of FRL, % of students are English language learners, % of Hispanic students, % of African American, % 
of Asian, % of White, % of female students, average age, average days of suspension, average days of absence, and 
the average and standard deviation of students’ prior math and reading test scores. 

Grade and year fixed effects are included.. 

Standard errors are included in the parentheses and clustered at the school- year level. 


¥ » <0.1 * p <0.05 ** p <0.01 *** p <0.001 
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Table C2. Estimated school-level spillover effects in elementary and secondary schools 


separately 


Elementary 


0.015* 
(0.006) 


0.015* 
(0.006) 


“Linear-in-means”: Prior stable 
effectiveness of new transfer 
peers 

“Relatively effective”: Positive 
values of the difference in prior 
stable effectiveness between 
new transfer peers and focal 
teachers 

“Relatively ineffective”: Negative 
values of the difference in prior 
effectiveness between new 
transfers and focal teachers 

F-statistics for the difference 
between “relatively effective” 
and “relatively ineffective” 

“Absolute effectiveness”: Prior 
stable effectiveness of new 
transfer peers < Prior stable 
effectiveness of focal teachers 

Students’ own (focal) teachers’ 0.121*** 
prior effectiveness (0.006) 

N 129,915 


0.642 


0.024* 
(0.010) 


-0.010 
(0.008) 


TAS** 


-0.005 
(0.004) 


0.136*** 
(0.008) 
129,915 


0.642 


0.120*** 
(0.006) 


129,915 
adj. R-sq 0.642 


Note: data from 2003-04 to 2012-13 


Middle 
-0.002 -0.002 
(0.005) (0.005) 

0.006 

(0.008) 

0.005 

(0.006) 

0.00 
-0.003 
(0.003) 
0.079*** 0.078*** 0.078*** 
(0.005) (0.007) (0.005) 
190,800 190,800 190,800 
0.684 0.684 0.684 


All models include student, classroom, and school covariates. Student covariates include eligibility for free or 
reduced price lunch (FRL), whether the student is an English language learner, the student’s race/ethnicity, gender, 
age, prior suspension, prior absence, and prior math and reading test scores. Classroom and school covariates 
include % of FRL, % of students are English language learners, % of Hispanic students, % of African American, % 
of Asian, % of White, % of female students, average age, average days of suspension, average days of absence, and 
the average and standard deviation of students’ prior math and reading test scores. 


Grade and year fixed effects are included. 


Standard errors are included in the parentheses and clustered at the school- year level. 


¥ p<0.1 * p <0.05 ** p <0.01 *** p <0.001 
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